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Transposons are mobile genetic ele- 
ments that can be harmful for the 
host when mobilized. However, they are 
also genomic reservoirs for novel genes 
that can be evolutionarily beneficial. 
There are many examples of domesticated 
transposases, which play important roles 
in the hosts. In most cases domesticated 
transposases have lost their endonucle- 
ase activities and the hosts utilize their 
DNA-binding properties. However, some 
other domesticated transposases perform 
endonuclease activities for host biological 
processes. Because such a catalytically 
active transposase is potentially harm- 
ful for the integrity of the host genome, 
its activity should be tightly regulated. 
The catalytically active domesticated 
piggyBac transposase Tpb2p catalyzes 
programmed DNA elimination in the 
ciliate Tetrahymena. Here, we discuss 
the regulatory mechanism that prevents 
unintended DNA cleavage by Tpb2p 
and compare it to another well-studied 
catalytically active domesticated trans- 
posase, the RAG recombinase in V(D)J 
recombination. The regulatory mecha- 
nisms involve the temporarily regulated 
expression of the transposases, the target 
sequence preference of the endonucle- 
ase, and the recruitment of the trans- 
posases to locally restricted chromatin 
environments. 

Transposases are enzymes encoded 
by DNA transposons. They catalyze the 
excision of the transposon at one genomic 
locus and its integration at a distant posi- 
tion. Both of these steps can be deleterious 
to the host. The excision step causes DNA 



double-strand breaks that are harmful if 
not repaired properly 1 and their integra- 
tion may disrupt host genes. Nonetheless, 
the harmfulness of transposases is only one 
side of the coin, and it is becoming clear 
that many transposases have been domes- 
ticated for the benefit of their hosts. 2 In 
most cases, the host benefits from the 
DNA- or chromatin-interaction abil- 
ity of the transposase. Examples include 
Daysleeper in Arabidopsis 3 and the wide- 
spread eukaryotic Cenp-b. 4 In contrast, 
there are only a few examples in which 
the domesticated transposase is still cata- 
lytically active. Because the endonuclease 
activity of transposases represents a great 
threat to the host, tight regulatory mech- 
anisms are necessary to prevent unin- 
tended genome rearrangements. The most 
famous example of a domesticated trans- 
posase with catalytic activity is the RAG 
(recombination-activated gene) recombi- 
nase from jawed vertebrates. 5 The RAG 
enzyme complex containing RAG1 and 
RAG2 catalyzes the double-strand break 
formation in V(D)J recombination dur- 
ing lymphocyte development. Recently, 
we and others reported further examples 
of catalytically active transposases from 
the piggyBac family that play essential 
roles in programmed DNA elimination 
in the ciliated protists Tetrahymena and 
Paramecium. 6,7 Here, we discuss how the 
activity of the Tetrahymena piggyBac trans- 
posase 2 (Tpb2p) is regulated to catalyze 
programmed DNA elimination without 
causing harmful genome rearrangement 
events through a comparison of Tpb2p 
and the RAG recombinase. 
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Figure 1. Conjugation and DNA elimination in Tetrahymena. (A) During conjugation, the Mic gives rise to the newMic and the new Mac. DNA elimination 
occurs in the exconjugant, which has 2 Mies and 2 new Macs (middle); red = Mic, purple = Mac. (B) DNA elimination removes lESs from the developing 
Mac genome, and the flanking regions are ligated. (C) DNA elimination occurs in specific nuclear compartments, heterochromatin bodies, which con- 
tain histone H3 K9me3/K27me3, Pddlp, and Tpb2p. We propose that this compartmentalization restricts Tpb2p's action (see text). (D) Localization of 
Tpb2p, Pddlp, and H3K9me3 in heterochromatin bodies, as shown by immunofluorescence staining. Asterisk = new Mac, Arrow = Mic. 



Most ciliates, such as Tetrahymena 
thermophila, have two structurally and 
functionally distinct nuclei in a single 
cell (Fig. 1A). The large, polyploid mac- 
ronucleus (Mac) provides the cell with all 
RNAs necessary for vegetative growth, 
whereas the small, diploid micronucleus 
(Mic) is transcriptionally silenced dur- 
ing vegetative growth. During sexual 
reproduction, the Mic acts as the germ- 
line nucleus and produces both the Mac 
and the Mic of the progeny 8 (Fig. 1A). 
During new Mac development, two types 
of programmed genome rearrangement 
events occur in Tetrahymena. The first 
is chromosome breakage, by which five 
germline-derived chromosomes are frag- 
mented to approximately 200 Mac chro- 
mosomes. 9 The breaks are repaired by de 
novo telomere addition. The second type 
of genome restructuring is the elimination 
of more than 8000 internal eliminated 
sequences (IESs), followed by ligation of 



their flanking sequences (Fig. IB). The 
elimination of IESs occurs reproducibly: 
invariable sets of IESs are eliminated from 
the Mac, and the majority of their bound- 
aries vary by only a few to several base 
pairs. 10 IESs are thought to be transposon 
remnants, with some of them still having 
the potential to be activated. 11 Therefore, 
IES excision is a process that removes 
harmful genetic elements from the tran- 
scriptionally active Mac. Moreover, it 
is essential for the streamlining of the 
somatic genome because it removes some 
IESs from gene-coding regions, creating 
functional genes. 12 

We recently showed that the endo- 
nuclease activity of the domesticated pig- 
gyBac transposase Tpb2p is required for 
IES elimination in Tetrahymena^ which 
strongly suggests that Tpb2p is the enzyme 
responsible for excising IESs. Indeed, our 
in vitro study showed a preferred activity 
of Tpb2p at the boundary sequences of 



well-studied R-IES 13 and other elements 
in vitro (unpublished). Different IESs 
in Tetrahymena do not share any detect- 
able common sequences, neither within 
themselves nor in their flanking regions. 
Therefore, it is expected that, in sharp con- 
trast to canonical DNA transposases, the 
enzyme that excises IESs should have pro- 
miscuous substrate specificity. Consistent 
with this idea, we found that only the 2nd 
and 3rd bases downstream of the cleavage 
site are crucial for the precision of the cleav- 
age at the R-IES boundary both in vitro 
and in vivo. 13 However, this promiscuous 
substrate specificity of Tpb2p raises a ques- 
tion: how is the precise boundary of IESs 
specified? We believe one of the key com- 
ponents to solve this issue is the hetero- 
chromatin formation that occurs at IESs. 

The IES elimination process can be 
divided into four distinct steps: i) small 
RNA-directed heterochromatin forma- 
tion, ii) assembly of heterochromatin 
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Figure 2. Comparison of the regulatory mechanisms of RAG recombinase and Tpb2p. 



bodies (Fig. 1C), iii) DNA excision, and 
iv) ligation of the flanking DNA by a 
non-homologous end-joining (NHEJ) 
pathway. 14 Heterochromatin formation 
includes the de novo establishment of tri- 
methylation on lysine 9 and lysine 27 of 
histone H3 (H3K9me3 and K27me3), 
specifically at IESs 15 - 16 (Fig. 1C). The 
chromodomain protein Pddlp subse- 
quently binds one or both of the histone 
modifications and promotes the assem- 
bly of IESs into sub-nuclear foci (het- 
erochromatin bodies) 17 (Fig. ID). We 
have shown that Tpb2p is required for 
the assembly of heterochromatin bodies 
and DNA excision in Tetrahymena? ,n 
The fact that endonucleolytically inac- 
tive Tpb2p can only promote the former 
step supports the longstanding idea that 
DNA excision occurs in heterochromatin 
bodies. The cysteine-rich domain (CRD) 
of Tpb2p, which might fold into a PHD 
finger, was found to preferentially inter- 
act with peptides having the histone H3 
N-terminal tail sequence when they were 
tri-methylated at either position lysine 9 
(K9me3) or lysine 27 (K27me3) in vitro. 13 
Furthermore, the mutation of two cyste- 
ines in the CRD led to the inability of 
Tpb2p to promote heterochromatin body 
formation in vivo. 13 Therefore, Tpb2p is 
most probably recruited to IESs via inter- 
actions between H3K9/K27me3 and 
CRD, and this interaction is necessary 



for the assembly of heterochromatin bod- 
ies. We speculate that heterochromatin 
formation at IESs restricts the endonu- 
cleoly tic action of Tpb2p in two ways : by 
specifically recruiting Tpb2p to the IESs 
and by preventing Tpb2p from access- 
ing the middle region of IESs. Tpb2p 
may only be able to cleave euchromatic 
regions adjacent to heterochromatinized 
IESs. A potential regulation could also 
be mediated by cis-regulatory sequences 
that are found close to some IESs. 1822 
They might be able to position Tpb2p 
in a reproducible manner by localizing 
nucleosomes at precise positions and/or 
by arranging IESs in distinct chromatin 
compartments. 

Although the interaction of Tpb2p 
with heterochromatin may restrict its 
action at the edges of IESs, the promiscu- 
ous substrate specificity of Tpb2p could 
still potentially cause undesirable DNA 
cleavage at non-target genomic locations. 
We currently do not know how such a 
deleterious action of Tpb2p is inhibited 
in Tetrahymena. To consider how Tpb2p 
might be regulated, it is worth recall- 
ing what is known about another cata- 
lytically active domesticated transposase, 
RAG recombinase. Here, we propose 
a potential regulatory mechanism of 
Tpb2p that was deduced from a com- 
parison of Tpb2p and RAG recombinase 
(Fig. 2). 



First, these potentially harmful 
enzymes are only expressed in a particular 
developmental process with a short time 
window. Tpb2p expression is limited to 
the stage of new Mac development when 
DNA elimination occurs. 7 The RAG pro- 
teins are only expressed during lymphocyte 
development and are quickly degraded 
after productive V(D)J recombination by 
the proteasome. 23,24 Therefore, it would be 
interesting to study whether there is a sim- 
ilar active protein degradation mechanism 
for Tpb2p in Tetrahymena. 

Second, in both DNA elimination 
and V(D)J recombination, the chroma- 
tin environment of the target sequences 
regulates the interaction of the recombi- 
nase with the target sequences. As dis- 
cussed above, Tpb2p is likely tethered to 
IESs through heterochromatin-specific 
histone modifications. In contrast, the 
heterochromatic environment inhibits the 
access of RAG recombinase to recombina- 
tion signal sequences (RSS), 25 a consensus 
sequence at V, D and J genes. The recom- 
binase can access an RSS only when the 
locus is activated for transcription, accom- 
panied by euchromatic modifications, 
including H3K4me3. 26 Moreover, RAG's 
endonuclease is activated upon binding 
H3K4me3. 26 It has been proposed that 
the binding of RAG2 via its PHD finger 
to H3K4me3 releases an auto-inhibition 
of the RAG1 protein. 27 Analogous to 
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this mechanism, the endonuclease activ- 
ity of Tpb2p may be auto-inhibited and 
specifically activated upon its interaction 
with heterochromatin. Alternatively, a 
protection mechanism that repels Tpb2p 
or inhibits its enzymatic activity in non- 
IES (and thus euchromatic) regions could 
be present. Further study is necessary to 
reveal whether there is such an additional 
layer of regulation of Tpb2p's endonucle- 
ase activity. 

Lastly, both DNA elimination and 
V(D)J recombination likely occur in 
specialized nuclear compartments. In 
Tetrahymena, DNA elimination is believed 
to occur in the heterochromatin bodies 13 
and in mammals, V(D)J recombination 
is thought to occur in recombination cen- 
ters. 28 In addition, DNA looping helps to 
bring separated gene segments in closer 
proximity for recombination. 29 This com- 
partmentalization may inhibit the sponta- 
neous action of domesticated transposases 
outside of their targets. As it is not known 
how IESs are assembled into heterochro- 
matin bodies in Tetrahymena, it would 
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